A note on Sattath and Tversky's, Saitou and Nei's, and Studier and Keppler's algorithms for inferring phylogenies from evolutionary distances.

نویسنده

  • O Gascuel
چکیده

Agglomerative algorithms iteratively pick a pair of taxa, create a new node that represents the cluster of these taxa, and compute a new distance matrix with reduced size where both taxa are replaced by this node. The cycle is repeated until the number of taxa becomes three (or two for rooted trees). This general scheme was first applied to additive unrooted trees by Sattath and Tversky (ADDTREE method; 1977) in the context of mathematical psychology. The neighbor-joining (NJ) method of Saitou and Nei ( 1987) widely popularized this approach in the phylogenetic study. This method is based on the minimum evolution principle and provides trees with near-minimal sum of branch-length estimates. An alternative formulation of the NJ method with reduced computational complexity was given by Studier and Keppler (SK method; 1988), while Rzhetsky and Nei ( 1992, 1993) clarified the theoretical foundation of the minimum evolution principle. Several simulations (Saitou and Nei 1987; Nei 199 1) have shown a high relative efficiency of ADDTREE and of the NJ method in recovering the true topology. These studies have also shown that ADDTREE and the NJ method, whose principles seem very different, are in fact close and usually provide identical or similar trees. For example, they obtain the same tree with Case’s ( 1978) data. The explanation for this proximity was given by Saitou and Nei ( 1987) for four taxa. In this note, we account for this proximity regardless of the number of taxa, and we show that the minimum evolution principle, as employed in the NJ method, is very close to the neighborliness used by Sattath and Tversky ( 1977) and by Fitch ( 198 1) in a nonagglomerative way. In the following, we recall the principles of the ADDTREE, NJ, and SK methods and explain why they are so close. We will only be concerned with the construction of the tree shape, and not with branch-lengths estimation. For the latter aspect, we refer the reader to the original papers and to

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neighbor Joining Algorithms for Inferring Phylogenies via LCA Distances

Reconstructing phylogenetic trees efficiently and accurately from distance estimates is an ongoing challenge in computational biology from both practical and theoretical considerations. We study algorithms which are based on a characterization of edge-weighted trees by distances to LCAs (Least Common Ancestors). This characterization enables a direct application of ultrametric reconstruction te...

متن کامل

The neighbor-joining method: a new method for reconstructing phylogenetic trees.

A new method called the neighbor-joining method is proposed for reconstructing phylogenetic trees from evolutionary distance data. The principle of this method is to find pairs of operational taxonomic units (OTUs [= neighbors]) that minimize the total branch length at each stage of clustering of OTUs starting with a starlike tree. The branch lengths as well as the topology of a parsimonious tr...

متن کامل

Building Very Large Neighbour-joining Trees

The neighbour-joining method by Saitou and Nei is a widely used method for phylogenetic reconstruction, made popular by a combination of computational efficiency and reasonable accuracy. With its cubic running time by Studier and Kepler, the method scales to hundreds of species, and while it is usually possible to infer phylogenies with thousands of species, tens or hundreds of thousands of spe...

متن کامل

A Note on Evolutionary Rate Estimation in Bayesian Evolutionary Analysis: Focus on Pathogens

Bayesian evolutionary analysis provide a statistically sound and flexible framework for estimation of evolutionary parameters. In this method, posterior estimates of evolutionary rate (μ) are derived by combining evolutionary information in the data with researcher’s prior knowledge about the true value of μ. Nucleotide sequence samples of fast evolving pathogens that are taken at d...

متن کامل

Using Metaheuristic Algorithms Combined with Clustering Approach to Solve a Sustainable Waste Collection Problem

Sustainability is a monumental issue that should be considered in designing a logistics system. In order to incorporate sustainability concepts in our study, a waste collection problem with economic, environmental, and social objective functions was addressed. The first objective function minimized overall costs of the system, including establishment of depots and treatment facilities. Addressi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 11 6  شماره 

صفحات  -

تاریخ انتشار 1994